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ABSTRACT 

The various technologies of data mining (DM) models 
for forecast of heart disease are discussed. Data 
mining plays an important role in building an 
intelligent model for medical systems to detect heart 
disease (HD) using data sets of Angioplasty and 
Stents patients, which involves risk factor associated 
with heart disease. Medical practitioners can help the 
patients by predicting the heart disease before 
occurring. The large data available from medical 
diagnosis is analyzed by using data mining tools and 
useful information known as knowledge is extracted. 
Mining is a method of exploring massive sets of data 
to take out patterns which are hidden and previously 
unknown relationships and knowledge detection to 
help the better understanding of medical data to 
prevent heart disease. There are many DM techniques 
available namely Classification techniques involving 
Naive bayes (NB), Decision tree (DT), Neural 
network (NN), Genetic algorithm (GA), Artificial 
intelligence (AI) and Clustering algorithms like KNN, 
and Support vector machine (SVM). Several studies 
have been carried out for developing prediction model 
using individual technique and also by combining two 
or more techniques. This research provides a quick 
and easy review and understanding of available 
prediction models using data mining. The comparison 
shows the accuracy level of each model given by 
different researchers. 
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TRENDS IN USING DATAMINING 
TECHNIQUES IN HEART DISEASE 

Although applying data mining is beneficial to 
healthcare, disease diagnosis, and treatment, few 
researches have investigated producing treatment 


plans for patients. Accurate diagnosis and treatment 
given to patients have been major issues highlighted 
in medical services. Recently, researchers started 
investigating using data mining techniques to handle 
the error and complexity of treatment processes for 
healthcare providers. Razali and Ali investigated 
generating treatment plans for acute upper respiratory 
infection disease patients using a decision tree. The 
model recommended treatment through giving drugs 
to patients showing accuracy of 94.73%. Applying 
association rules and decision tree to treatment plans 
are showing acceptable performance. However, the 
comparison with other data mining techniques such as 
naive bayes, neural network, and genetic algorithms 
still needs investigation. 

DATA MINING IN PREDICTION OF HEART 
DISEASE 

Despite the fact that data mining has been around for 
more than one decade, its potential is just been 
realized now. Data mining join factual examination, 
machine learning and database engineering to extract 
hidden patterns and connections from substantial 
databases Data mining mainly uses two 
methodologies: 

supervised and unsupervised learning’s. In supervised 
learning, a training set is utilized to learn model 
parameters though in unsupervised learning no 
training set is utilized like in K-Means clustering. The 
two most normal modeling goals of data mining are 
classification and prediction. Classification models 
classify discrete unordered values or data whereas 
prediction models predicts about continuous valued. 
Decision trees and Neural Networks are examples of 
classification models while Regression, Association 
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Rules and Clustering are examples of prediction 
algorithm. 

In this prediction of heart disease, we will use the 
following classification models of data mining are 
analyzed: 

A. Decision trees 

B. Neural networks 

C. Naive Bayes Classifier 

PROPOSED SYSTEM 

To enhance the prediction of classifiers, genetic 
search is integrated; the genetic search resulted in 13 
attributes which contributes more towards the 
prediction of the cardiac disease. The classifiers 
ensemble algorithms such as Random Forest are used 
for prediction of patients with heart disease. The 
classifiers are fed with reduced data set with 13 
attributes. Results are shown in. Observations exhibit 
that the Proposed classification technique outperforms 
other data mining techniques such as Narve Bayes and 
Decision Tree after incorporating feature subset 
selection but with high model construction time. 
Proposed classification technique performs 
consistently before and after reduction of attributes 
with the same model construction time. 

The proposed system will have the following features: 

> Collection of health care data of heart disease. 

> Storage and Processing of data using Weka 
V Applying Proposed classification algorithm. 

> Analyzing performance in terms of accuracy and 
error rate. 

BJECTIVES OF THE PROPOSED STUDY 

The purpose of this research work to increase the 
efficiency of improved classification technique. The 
main purpose of this goal is to increase the efficiency 
and accuracy of the decision tree dataset. 


1. To study the attribute of angioplasty and stent 
patients from various hospitals. 

2. To refine the dataset for improving accuracy, 
correctly classified instance and reduce error rate. 

3. To apply traditional and proposed algorithm on 
mixed dataset. 

4. To analyze the computation parameters for 

improving efficiency of data set. 

5. To critically analyze the diagnosis for angioplasty 
and stents patients using a improved classification 
technique. 

6. To suggest self-programmed algorithm for 

improving efficiency of decision tree technique. 

This research work focuses on above stated objectives 
which aim in increase the efficiency on correctly 
classified instances and reducing error rate. 

Improved Classification Algorithm 

The Improved Classification Algorithm is explained 
as below: 

Input: Dataset 

Output: Predicted class label 

set number of classes to N, number of features to M 

set /n determine the number of features at a node of 

decision 

tree, (m < M ) 

for each decision tree do 

Select randomly, a subset (with replacement) of 

training data that represents the N classes and use the 

rest of data to measure the error of the tree 

for each node of this tree do 

Select randomly: m features to determine the 

decision at the 

node and calculate the best split accordingly. 

end for 
end for 


Results 



Fig 1: The bar graph shows all attributes 
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Fig 2 Shows the values of CCI, ICI and Error Rate of OneR, ZeroR, Naive Bayes, and ICA Algorithms 
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Fig 3: Shows the values of TP Rate of OneR, ZeroR, Naive Bayes, and ICA Algorithms 


CONCLUSION 

Motivated by the world-wide increasing mortality of 
heart disease patients each year and the availability of 
huge amounts of data, researchers are using data 
mining techniques in the diagnosis of heart disease. 
Although applying data mining techniques to help 
health care professionals in the diagnosis of heart 
disease is having some success, the use of data mining 
techniques to identify a suitable treatment for heart 
disease patients has received less attention. Also, 
applying hybrid data mining techniques has shown 
promising results in the diagnosis of heart disease, so 
applying improved data mining technique in selecting 
the suitable treatment for heart disease patients needs 
further investigation. This research identifies gaps in 
the research on heart disease diagnosis and treatment 
and proposes a model to systematically close those 


gaps to discover if applying data mining techniques to 
heart disease treatment data can provide as reliable 
performance as that achieved in diagnosing heart 
disease patients. 

Improved Classification Algorithm (ICA) shows the 
better results in correctly classified instances, 
incorrect classified instances, error rate, TP Rate, FP 
Rate, Precision, Recall and F-Measure Values. 
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